Linear predict by hmgaudecker · Pull Request #87 · OpenSourceEconomics/skillmodels

hmgaudecker · 2026-03-18T04:45:54Z

Add linear Kalman predict fast path

Fixes #36.

Summary

Add linear_kalman_predict that uses direct matrix algebra (F @ x + c) instead of the unscented sigma-point transform, for models where all factors use linear or constant transition functions
maximization_inputs.py auto-selects the fast path via is_all_linear() — no API changes needed
Refactor likelihood functions to accept a generic predict_func callable instead of hardcoded kalman_predict + transition_func

Benchmark results

Tested on health-cognition (no_feedback_to_investments_linear, 4 latent factors, GPU 8 GiB):

	`om.Constraints` (unscented)	`linear-predict`
GPU: per iter (100 iters)	8.87s	8.36s
Speedup	—	~6%
JIT warmup	43.3s	40.4s
GPU memory	Higher (OOMs with ~5 GiB free)	Lower (runs with ~5 GiB free)
CPU: per iter (10 iters)	109.6s	107.9s (~1.6%, within noise)

The main benefit is reduced GPU memory usage — the unscented transform generates 2n+1 sigma points which are expensive to differentiate through, while the linear path uses a single matrix multiply. On a small model (4 factors), the speed gain is modest (~6% GPU), but the memory reduction is the difference between fitting on GPU vs OOMing when memory is constrained.

Test plan

Added unit tests for linear_kalman_predict and is_all_linear in test_kalman_filters.py
All 351 existing tests pass
Benchmarked against om.Constraints on real estimation task

Introduce strongly-typed dataclasses for model configuration: - Dimensions, Labels, Anchoring, EstimationOptions, TransitionInfo - FactorEndogenousInfo, EndogenousFactorsInfo This improves type safety and enables IDE autocompletion while keeping user-facing model_dict as a plain dictionary. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Replace dict fields with frozendict in frozen dataclasses to ensure true immutability: - Labels.aug_periods_to_periods - Labels.aug_stages_to_stages - Anchoring.outcomes - TransitionInfo.param_names, individual_functions, function_names - EndogenousFactorsInfo.aug_periods_to_aug_period_meas_types, factor_info 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Update process_model() to return a ProcessedModel frozen dataclass and update all consumers to use attribute access instead of dict access. This provides: - Better type safety with explicit typed fields - Immutability via frozen dataclass - IDE autocomplete support - Clear documentation of the model structure 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…c so that config.TEST_DATA_DIR is valid also for skillmodels the package (as opposed to the project).

…f NDArray.

The filtered_states DataFrame and params index both use aug_period as the period identifier, not period. This fixes KeyError when calling decompose_measurement_variance. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…ting values on FixedConstraints.

Remove list from loc type union, convert callers to tuple(). Update anchoring test expectations from list to tuple. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…in. Add tests.

The viz code assumed states DataFrames always have `aug_period` as a column, but pre-computed states (e.g. from health-cognition) may carry `period` in the index instead. Add `_normalize_states_columns` to promote index levels and rename `period` → `aug_period` when needed. Also document the period vs aug_period convention in CLAUDE.md. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

codecov · 2026-03-18T04:48:56Z

Codecov Report

❌ Patch coverage is 98.67550% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 96.91%. Comparing base (023775e) to head (5c63203).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
src/skillmodels/maximization_inputs.py	60.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #87      +/-   ##
==========================================
+ Coverage   96.86%   96.91%   +0.05%     
==========================================
  Files          57       57              
  Lines        4809     4952     +143     
==========================================
+ Hits         4658     4799     +141     
- Misses        151      153       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…as in-place operations.

janosg

Maybe we should compare the speed of just the update step to see if the linear one is implemented efficiently. Without a detailed analysis I would expect the linear update step to be at least twice as fast as the unscented one. Of course, in a model with few factors or many measurements per factor, the unscented predict might not have been the bottleneck anyways.

janosg · 2026-03-18T10:24:08Z

src/skillmodels/kalman_filters.py

+    for i, factor in enumerate(latent_factors):
+        if i in constant_factor_indices:
+            row = jnp.zeros(n_all_factors).at[i].set(1.0)
+            f_rows.append(row)
+            c_vals.append(0.0)
+        else:
+            coeffs = trans_coeffs[factor]
+            f_rows.append(coeffs[:-1])
+            c_vals.append(coeffs[-1])
+
+    f_mat = jnp.stack(f_rows)  # (n_latent, n_all)
+    c_vec = jnp.array(c_vals)  # (n_latent,)


Looks suboptimal but maybe Jax is smart enough at compiling the small array creation away. Have you tried different implementations?

Confirmed that Jax is smart enough. But I kept a more idiomatic version from the experiments and added a note.

…note.

hmgaudecker · 2026-03-18T11:45:48Z

Re: the question about a linear update step —

The measurement model in skillmodels is always linear (states @ loadings + controls @ control_params), so kalman_update already is the exact linear update. The unscented transform only appears in the predict step (to propagate states through potentially nonlinear transition functions). There is no "unscented update" counterpart — the same kalman_update runs regardless of whether transitions are linear or nonlinear.

The QR decomposition in the update operates on an (n_states+1) × (n_states+1) matrix whose structure depends only on the measurement model (loadings, meas_sd), not on the transition model. So a linear_kalman_update would be identical to the current function — no optimization opportunity here.

(this is Claude, obviously, but it does appear plausible without checking deeply)

hmgaudecker and others added 30 commits January 8, 2026 18:53

Ignore ty false positive.

562cfaa

Rename FactorEndogenousInfo to FactorInfo

264016c

🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Ensure complete typing in src/skillmodels.

3bb334b

Require Python 3.14 before fixing type annotations.

e2d687a

Move TESTS_DIR -> TEST_DATA_DIR, which points to a subdirectory of sr…

921c612

…c so that config.TEST_DATA_DIR is valid also for skillmodels the package (as opposed to the project).

Fix more linting issues.

dce66ad

Make ruff rules much stricter.

e0f59d0

Further tighten type annotations. Fix a missing run-time annotation o…

b15372a

…f NDArray.

Move unsafe_fixes = false from .pre-commit config to pyproject.

74b2f18

Fix query in data simulation.

04aaa10

More fixes to imports, add test.

493ead0

Use more tuples in place of lists to prevent errors.

51209ad

Fix typing.

21da176

Dataclasses for user input.

5925303

Simplify.

9590c46

Use modern rng everywhere.

69f6eab

Update CLAUDE.md

f8dac75

Get rid of if TYPE_CHECKING blocks

846534b

Update hooks and clean up

97d84b8

Call by name throughout.

b8a9fbc

Autogenerated docs, harmonised hooks / project configuration.

5012a8b

Get rid of model_dict.

7e7784e

Replace yaml model specifications by ModelSpec-s.

6934f79

Next shot at fixing pickling.

7cf471a

Add improved output formatting.

4380af8

Add variance decompositions.

f7a8aa3

hmgaudecker and others added 15 commits March 15, 2026 14:47

Use optimagic constraints directly. Need a wrapper because we are set…

c9519cf

…ting values on FixedConstraints.

Fix FixedConstraintWithValue.loc type and test expectations

1df0d98

Remove list from loc type union, convert callers to tuple(). Update anchoring test expectations from list to tuple. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Validate not-None input directly.

2be39aa

Simplify API by setting .selectors via .loc in __post_init__

ee4869a

Require dags 0.5.1

a32a177

First shot at fixing #36.

ede0017

Bug fix in kalman_filters.py — added [:n_latent] slice to s_in and c_…

e3f91d0

…in. Add tests.

Fix ty errors and add docs on linear predict.

0c6cc5a

Remove aug_periods from public-facing functions.

d23f80c

Merge branch 'output-formatting' into om.Constraints

c1256f9

Merge branch 'om.Constraints' into linear-predict

705488a

Previous commit was too greedy.

447719a

Merge branch 'output-formatting' into om.Constraints

350c3b9

Merge branch 'om.Constraints' into linear-predict

6388c0f

hmgaudecker added 5 commits March 18, 2026 05:53

Update docs based on benchmark results.

220a9b0

Use pixi 0.66 in CI; prek autoupdate.

94e8704

CHORE: Approximation-tolerant floating point comparisons, remove pand…

b1606b8

…as in-place operations.

Get rid of unnecessary block-comments.

980ada3

Merge branch 'output-formatting' into om.Constraints

c0fd717

janosg approved these changes Mar 18, 2026

View reviewed changes

hmgaudecker added 2 commits March 18, 2026 11:30

Merge branch 'om.Constraints' into linear-predict

7eb35a9

Idiomatic Jax for linear filter, though no speed difference. Added a …

d7ba962

…note.

hmgaudecker force-pushed the linear-predict branch from a5c2381 to d7ba962 Compare March 18, 2026 11:45

Base automatically changed from om.Constraints to main March 18, 2026 13:03

Merge branch 'main' into linear-predict

5c63203

hmgaudecker merged commit 2a77ee4 into main Mar 18, 2026
6 checks passed

hmgaudecker deleted the linear-predict branch March 18, 2026 13:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linear predict#87

Linear predict#87
hmgaudecker merged 77 commits intomainfrom
linear-predict

hmgaudecker commented Mar 18, 2026

Uh oh!

codecov bot commented Mar 18, 2026 •

edited

Loading

Uh oh!

janosg left a comment

Uh oh!

janosg Mar 18, 2026

Uh oh!

hmgaudecker Mar 18, 2026

Uh oh!

hmgaudecker commented Mar 18, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hmgaudecker commented Mar 18, 2026

Add linear Kalman predict fast path

Summary

Benchmark results

Test plan

Uh oh!

codecov bot commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

janosg left a comment

Choose a reason for hiding this comment

Uh oh!

janosg Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

hmgaudecker Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

hmgaudecker commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Mar 18, 2026 •

edited

Loading

hmgaudecker commented Mar 18, 2026 •

edited

Loading